Everyone’s Voice Matters: Quantifying Annotation Disagreement Using Demographic Information

نویسندگان

چکیده

In NLP annotation, it is common to have multiple annotators label the text and then obtain ground truth labels based on major annotators’ agreement. However, are individuals with different backgrounds various voices. When annotation tasks become subjective, such as detecting politeness, offense, social norms, voices differ vary. Their diverse may represent true distribution of people’s opinions subjective matters. Therefore, crucial study disagreement from understand which content controversial annotators. our research, we extract five datasets, fine-tune language models predict disagreement. Our results show that knowing demographic information (e.g., gender, ethnicity, education level), in addition task text, helps To investigate effect demographics their level, simulate combinations artificial explore variance prediction distinguish inherent controversy perspective. Overall, propose an innovative mechanism for better design process will achieve more accurate inclusive systems. code dataset publicly available.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Missing Annotation Disagreement using Eye Gaze Information

This paper discusses the detection of missing annotation disagreements (MADs), in which an annotator misses annotating an annotation instance while her counterpart correctly annotates it. We employ annotator eye gaze as a clue for detecting this type of disagreement together with linguistic information. More precisely, we extract highly frequent gaze patterns from the pre-extracted gaze sequenc...

متن کامل

Quantifying disagreement in argument-based reasoning

An argumentation framework can be seen as expressing, in an abstract way, the conflicting information of an underlying logical knowledge base. This conflicting information often allows for the presence of more than one possible reasonable position (extension/labelling) which one can take. A relevant question, therefore, is how much these positions differ from each other. In the current paper, w...

متن کامل

Crowdsourcing Disagreement for Collecting Semantic Annotation

This paper proposes an approach to gathering semantic annotation, which rejects the notion that human interpretation can have a single ground truth, and is instead based on the observation that disagreement between annotators can signal ambiguity in the input text, as well as how the annotation task has been designed. The purpose of this research is to investigate whether disagreement-aware cro...

متن کامل

Disagreement and Information Collection

This note shows that disagreement, in the sense of differing priors, may increase the incentives to collect information when two agents work on a joint project. The reason is that each agent believes that new data will confirm his own beliefs and thus ‘convince’ the other agents to do what the focal agent thinks is right.

متن کامل

Voice quality assessment using phase information: Application on voice pathology

One of the most important human abilities is speech along with hearing. Speech is the primary way in which we attune to the society. Our voice can uncover several information about us to other people. It reveals our energy level, our emotions, our personality and our artistry. Voice abnormalities may cause social isolation or may create problems in the professional field. Due to this significan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i12.26698